Search Result Clustering Method at NTCIR-5 Web Query Expansion Subtask

نویسندگان

  • Hiroyuki Toda
  • Ryoji Kataoka
چکیده

We use a retrieval system with search result clustering to tackle the NTCIR-5 WEB Query Term Expansion Subtask. The system clusters the search results in such a way as to make it easier for the user to select relevant documents as feedback documents. In addition, we select phrase words or named entities(NE) as query-expansion keywords from the feedback documents because these words tend to represent the characteristics of feedback documents and can retrieve relevant documents that were not retrieved by the initial keywords. Based on our evaluations, we report the efficiency of keyword expansion and the number of relevant documents in the feedback documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RUCIR at NTCIR-12 IMINE-2 Task

In this paper, we present our participation in the Query Understanding subtask and the Vertical Incorporating subtask of the NTCIR-12 IMine-2 task, for both English and Chinese topics. In the Query Understanding subtask, we combine the extracted candidates from search engine suggestions and Wikipeida, and classify their verticals after clustering and ranking them. In the Vertical Incorporating ...

متن کامل

Search Intent Mining by Word Vectors Clustering at NTCIR-IMine

This paper presents a method for intent mining based on semantic vectors and search results clustering. Our algorithm represent words as documents and performs a state-of-theart approach for query log driven clustering. Similarities between query logs and words are calculated by using semantic vectors. Based on a manual selection of vertical representatives, our method is able to correctly iden...

متن کامل

HITSZ-ICRC at NTCIR-11 Temporalia Task

* Corresponding Author ABSTRACT Temporal Information Access (Temporalia) task is a pilot task at NTCIR-11 for the first year. HITSZ-ICRC group participated in Temporalia task, worked in both Temporal Query Intent Classification (TQIC) subtask and Temporal Information Retrieval (TIR) subtask. In TQIC subtask, firstly, we extracted different linguistic level features from user query, extracted ex...

متن کامل

Overview of the NTCIR-5 WEB Query Term Expansion Subtask

The query term expansion subtask was conducted to establish an evaluation framework for information retrieval (IR) systems that focus on the effectiveness of query term expansion techniques. However, the quality of query term expansions are affected by several factors (e.g., IR system using expanded query, quality of initial query, etc.), so it is difficult to evaluate this technique. In this s...

متن کامل

NTCIR-5 Query Expansion Experiments using Term Dependence Models

This paper reports the results of our experiments performed for the Query Term Expansion Subtask, a subtask of the WEB Task, at the Fifth NTCIR Workshop, and the results of our further experiments. In this paper we mainly investigated: (i) the effectiveness of query formulation by composing or decomposing compound words and phrases of the Japanese language, which is based on a theoretical frame...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005